[Deterministic] Move paddle version batch invariant pkg to Fastdeploy #4763

littledgg · 2025-11-03T07:06:44Z

Motivation

Achieving batch invariance in the PaddlePaddle framework.
Batch invariance：https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/

想要跑通需要安装如下内容，paddle必须是比较新的(建议用最新的)

pip install --pre paddlepaddle-gpu -i https://www.paddlepaddle.org.cn/packages/nightly/cu129/
pip install triton

python tests/batch_invariant/test_batch_invariance.py

如果能看见Batch-Invariant Mode下均为0就代表正确

目前只有log_softmax算子尽管精心构造了输入数据，但是在原版实现似乎就已经具备批处理不变性了。

TODO：严格对齐API目前(mm和log_softmax还存在问题)，可以考虑把test case整合进一个文件,文件中列出的若干TODO

Modifications

Usage or Command

Accuracy Tests

Checklist

Add at least a tag in the PR title.
- Tag list: [[FDConfig],[APIServer],[Engine], [Scheduler], [PD Disaggregation], [Executor], [Graph Optimization], [Speculative Decoding], [RL], [Models], [Quantization], [Loader], [OP], [KVCache], [DataProcessor], [BugFix], [Docs], [CI], [Optimization], [Feature], [Benchmark], [Others], [XPU], [HPU], [GCU], [DCU], [Iluvatar], [Metax]]
- You can add new tags based on the PR content, but the semantics must be clear.
Format your code, run pre-commit before commit.
Add unit tests. Please write the reason in this PR if no unit tests.
Provide accuracy results.
If the current PR is submitting to the release branch, make sure the PR has been submitted to the develop branch, then cherry-pick it to the release branch with the [Cherry-Pick] PR tag.

paddle-bot · 2025-11-03T07:06:54Z

Thanks for your contribution!

gongshaotian · 2025-11-05T02:51:37Z

please format you code

littledgg · 2025-11-05T03:11:04Z

please format you code

done

Copilot

Pull Request Overview

This PR introduces batch-invariant implementations of key PaddlePaddle operations (mm, addmm, log_softmax, mean) using Triton kernels to achieve deterministic inference results regardless of batch size. The implementation is adapted from the batch_invariant_ops library and integrated into FastDeploy.

Adds custom Triton kernel implementations for deterministic matrix operations and reduction operations
Provides a context manager to toggle between standard and batch-invariant modes
Includes comprehensive test files demonstrating batch invariance for each operation

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 18 comments.

Show a summary per file

File	Description
fastdeploy/model_executor/layers/batch_invariant_ops/batch_invariant_ops.py	Core implementation with Triton kernels for batch-invariant operations and mode switching functionality
fastdeploy/model_executor/layers/batch_invariant_ops/init.py	Module initialization exporting public API
tests/batch_invariant/test_batch_invariance_op_mm.py	Test suite for matrix multiplication batch invariance
tests/batch_invariant/test_batch_invariance_op_mean.py	Test suite for mean operation batch invariance
tests/batch_invariant/test_batch_invariance_op_logsoftmax.py	Test suite for log_softmax operation batch invariance
tests/batch_invariant/test_batch_invariance_op_addmm.py	Test suite for addmm operation batch invariance

fastdeploy/model_executor/layers/batch_invariant_ops/batch_invariant_ops.py

tests/batch_invariant/test_batch_invariance_op_mm.py

tests/batch_invariant/test_batch_invariance_op_mean.py

tests/batch_invariant/test_batch_invariance_op_logsoftmax.py

tests/batch_invariant/test_batch_invariance_op_addmm.py

tests/batch_invariant/test_batch_invariance_op_mm.py

…ariant_ops.py 存在于原版代码注释中的版本控制遗留的内容，确实应该去除 Co-authored-by: Copilot <[email protected]>

Co-authored-by: Copilot <[email protected]>

…ariant_ops.py Co-authored-by: Copilot <[email protected]>

…deter

Move batch invariant pkg to Fastdeploy

88dcea8

paddle-bot bot added the contributor External developers label Nov 3, 2025

gongshaotian marked this pull request as ready for review November 3, 2025 07:08

littledgg changed the title ~~[Deterministic] Move batch paddle version invariant pkg to Fastdeploy~~ [Deterministic] Move paddle version batch invariant pkg to Fastdeploy Nov 3, 2025

littledgg added 3 commits November 3, 2025 19:57

fix problem and pre-commit

77d804e

move test

9f97167

Change testcase to FD style

f3dab79

gongshaotian mentioned this pull request Nov 4, 2025

[Deterministic Inference] Support Deterministic Inference #4651

Open

22 tasks

Add testcase for log_softmax

02c328f

littledgg force-pushed the deter branch from 73427df to 02c328f Compare November 4, 2025 11:07

littledgg added 2 commits November 4, 2025 19:52

Add testcase for mean

1d2021c

Add testcase for addmm

471b075

gongshaotian added this to FastDeploy Deterministic Inference Nov 5, 2025

gongshaotian moved this to In Progress in FastDeploy Deterministic Inference Nov 5, 2025

gongshaotian self-assigned this Nov 5, 2025

gongshaotian added Deterministic and removed contributor External developers labels Nov 5, 2025

fix pre-commit

daf47cd

littledgg and others added 3 commits November 5, 2025 11:31

API check v0.9

c85f38b

move to layers and add comment about log_softmax

a573a54

Merge branch 'develop' into deter

e2b409d

Copilot AI review requested due to automatic review settings November 12, 2025 08:19

Copilot started reviewing on behalf of littledgg November 12, 2025 08:19 View session

Copilot finished reviewing on behalf of littledgg November 12, 2025 08:21

Copilot AI reviewed Nov 12, 2025

View reviewed changes

littledgg and others added 2 commits November 12, 2025 16:46

Update fastdeploy/model_executor/layers/batch_invariant_ops/batch_inv…

f25a8c7

…ariant_ops.py 存在于原版代码注释中的版本控制遗留的内容，确实应该去除 Co-authored-by: Copilot <[email protected]>

Update tests/batch_invariant/test_batch_invariance_op_mean.py

5b44576

Co-authored-by: Copilot <[email protected]>

littledgg and others added 7 commits November 12, 2025 16:56

Update tests/batch_invariant/test_batch_invariance_op_logsoftmax.py

2026212

Co-authored-by: Copilot <[email protected]>

Update fastdeploy/model_executor/layers/batch_invariant_ops/batch_inv…

48c0ed1

…ariant_ops.py Co-authored-by: Copilot <[email protected]>

change comment after copilot fix

99b8363

Merge branch 'develop' into deter

6d1dfd1

fix bug about addmm

36e8d86

Merge branch 'deter' of https://github.com/littledgg/FastDeploy into …

4f070ec

…deter

Merge branch 'develop' into deter

32372a4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Deterministic] Move paddle version batch invariant pkg to Fastdeploy #4763

[Deterministic] Move paddle version batch invariant pkg to Fastdeploy #4763

littledgg commented Nov 3, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Nov 3, 2025

Uh oh!

gongshaotian commented Nov 5, 2025

Uh oh!

littledgg commented Nov 5, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Deterministic] Move paddle version batch invariant pkg to Fastdeploy #4763

Are you sure you want to change the base?

[Deterministic] Move paddle version batch invariant pkg to Fastdeploy #4763

Conversation

littledgg commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Usage or Command

Accuracy Tests

Checklist

Uh oh!

paddle-bot bot commented Nov 3, 2025

Uh oh!

gongshaotian commented Nov 5, 2025

Uh oh!

littledgg commented Nov 5, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

littledgg commented Nov 3, 2025 •

edited

Loading